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METHOD AND SYSTEM FOR FINDING SIMILAR RECORDS IN MIXED 
FREE-TEXT AND STRUCTURED DATA 
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ABSTRACT OF THE DISCLOSURE 

A technique for data mining where the available data contains both structured as well as 
unstructured (free-text) data. The present invention combines together the information 
10 available from different types of data to provide a single similarity score indicating the 
degree of similarity between records. Thus, a data evaluation application selects two 
H records from a database and compares corresponding fields from the two records. The 

q application determines whether to apply a nominal matching process, an ordinal matching 

v *f process, or a vector-space matching process depending on the type of data in each pair of 

tQ 1 5 corresponding fields. The application sums the matching scores for all the fields in the 
Jjj records to compute the similarity score. 


